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Nonstructural protein 14 (nsp14) of coronaviruses (CoV) is important for 
viral replication and transcription. The N-terminal exoribonudease 
(ExoN) domain plays a proofreading role for prevention of lethal 
mutagenesis, and the C-terminal domain functions as a (guanine-N7) 
methyl transferase (N7-MTase) for mRNA capping. The molecular basis 
of both these functions is unknown. Here, we describe crystal 
structures of severe acute respiratory syndrome (SARS)-CoV nsp14 in 
complex with its activator nonstructural proteinlO (nspIO) and func¬ 
tional ligands. One molecule of nspIO interacts with ExoN of nsp14 to 
stabilize it and stimulate its activity. Although the catalytic core of 
nsp14 ExoN is reminiscent of proofreading exonudeases, the presence 
of two zinc fingers sets it apart from homologs. Mutagenesis studies 
indicate that both these zinc fingers are essential for the function of 
nsp14. We show that a DEEDh (the five catalytic amino acids) motif 
drives nucleotide excision. The N7-MTase domain exhibits a noncanon- 
ical MTase fold with a rare (3-sheet insertion and a peripheral zinc 
finger. The cap-precursor guanosine-P3-adenosine-5,5'-triphosphate 
and S-adenosyl methionine bind in proximity in a highly constricted 
pocket between two p-sheets to accomplish methyl transfer. Our stud¬ 
ies provide the first glimpses, to our knowledge, into the architecture 
of the nsp14-nsp10 complex involved in RNA viral proofreading. 
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C oronaviruses (CoV), belonging to the Coronaviridae family in 
the order Nidovirales (1), are one of the major threats to public 
health. The most notable infections are the severe acute respiratory 
syndrome (SARS) and Middle East respiratory syndrome (MERS) 
caused by the SARS-CoV and MERS-CoV, respectively (2, 3). 
SARS-CoV brought about more than 8,000 infections and 800 
deaths, and MERS-CoV has caused 1,139 cases of infections and 
431 deaths till May 25, 2015 (www.who.int/en/). 

Co Vs have the largest genomes among RNA viruses (4). There 
are 14 ORFs in the genome of SARS-CoV. Among these, ORFla 
and ORFlb encode 16 nonstructural proteins (nsp) that pre¬ 
dominantly play a role in replication and transcription (5). Within 
these nsps, nsp 12 functions as a RNA-dependent RNA polymerase 
(RdRp), and nsp8 together with nsp7 functions as a primase and 
confers processivity to polymerization by nsp 12 (6-8). More im¬ 
portantly, the nsp7-nsp8-nspl2 complex can associate with nspl4 
without impacting RNA synthesis (8). This interaction is crucial, 
because nsp 14 has been shown to play a pivotal role in decreasing 
the incidence of mismatched nucleotides through its exoribonu- 
clease domain (ExoN) (9-11), a role akin to a proofreading ExoN 
associated with a polymerase. Abrogation of the nspl4 ExoN ac¬ 
tivity results in enhanced sensitivity to the RNA mutagen 5-fluo- 
rouracil (12,13). The nspl4-nspl0 complex can exquisitely excise 3' 
mismatched nucleotides from dsRNA (14). Disturbance of the in¬ 
teraction between nspl4 and nspIO has been shown to result in a 
decrease in replication fidelity (15). Consequently, in contrast to the 
general replication fidelity of RNA viruses (10 _3 -10 -5 ), the low 
mutation rate (lO^-lO -7 ) of SARS-CoV is tied to the ExoN activity. 

A DEDDh (the five catalytic amino acids) motif drives catalysis by 


nspl4 that is important for the viral replication and transcription 
(14, 16). In vivo studies using mouse models have demonstrated a 
role for this ExoN activity in viral virulence and pathogenesis (17). 
Nspl4 is highly conserved within the Coronaviridae family. In- 
triguingly, ExoNs also are encoded by RNA viruses belonging to the 
order Nidovirales with genomes larger than 20 kb (4, 18, 19). 

In addition, nspl4 is also known to function as an S-adenosyl 
methionine (SAM)-dependent (guanine-N7) methyl transferase 
(N7-MTase) (20). Assembly of a capl structure at the 5' end of viral 
mRNA assists in translation and evading host defense (21-23). 
Formation of this cap in SARS-CoV requires four sequential re¬ 
actions. First, nsp 13 RNA triphosphatase (RTPase) hydrolyzes 
nascent RNA to yield pp-RNA (24). Then an unknown guanylyl- 
transferase (GTase) hydrolyzes GTP, transfers the product GMP to 
pp-RNA, and creates Gppp-RNA. Then nspl4 methylates the 5' 
guanine of the Gppp-RNA at the N7 position, followed by meth- 
ylation of the ribose of the first nucleotide at the 2-0 position by 
nspl6 (20, 25). NspIO has been shown to activate the 2 -O-MTase 
activity of nsp 16 by stabilizing the SAM-binding pocket and 
extending the substrate RNA-binding groove of nsp16 (26, 27). 
Similarly, the ExoN activity of nspl4 is fully unleashed only in the 
presence of nspIO (14). However, the molecular basis for this ac¬ 
tivation is poorly understood. 


Significance 

Proofreading exonudeases contributing to replication fidelity in 
DNA viruses and cellular organisms are well known; however, 
proofreading in RNA viruses was unknown until recently. Corona- 
virus nonstructural protein 14 (nsp14) has been shown to function 
as a proofreading exoribonudease. Additionally, nsp14 shows 
(guanine-N7) methyl transferase activity for viral mRNA capping. 
Both roles are important for viral replication and transcription. Here, 
we report the structures of severe acute respiratory syndrome- 
coronavirus nsp14 in complex with its activator nonstructural pro¬ 
tein 10 (nspIO) and functional ligands. Structural observations 
coupled with mutagenesis and functional assays provide a better 
understanding of the function of nsp14. Furthermore, the structures 
of the nsp14-nsp10 complex demonstrate several unique niches 
that could be targeted for development of potent antiviral drugs. 
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Fig. 1. Overall structure of the nsp14-nsp10 complex. (A) Domain organization 
of nsp14 and nspIO. Domain boundaries are marked with residue numbers. 
(B) Cartoon representation of the structure of the nsp14-nsp10 heterodimer. 
NspIO, the ExoN domain, the N7-MTase domain, and the loop at the N terminus 
of N7-MTase are marked red, green, marine, and pink, respectively. Invisible 
residues from 454 to 464 of nsp14 are shown by a dashed line. Magnesium, zinc 
ions, and the N7-MTase substrates SAH and GpppA are shown as spheres and are 
colored magenta, gray, yellow, and light pink, respectively. Three zinc fingers 
(ZF) of nsp14 are highlighted with residues shown as sticks. 


Here, we report the nspl4-nspl0 complex structures and show 
how nspl4 is activated by nspIO. Structural observations coupled 
with mutagenesis and functional assays unveil several previously 
unknown features of nspl4 and provide a better understanding of 
its participation in proofreading and mRNA capping. 

Results and Discussion 

Overall Structure of the nsp14-nsp10 Complex. Full-length nspl4 
was coexpressed with nspIO in Escherichia coli and purified as a 
preformed complex (Fig. L4 and Fig. SI). The unliganded, SAM- 
bound, and S-adenosyl homocysteine (SAH)-guanosine-P3-adeno- 
sine-5',5'-triphosphate (GpppA)^bound nspl4-nspl0 complex struc¬ 
tures were refined to 3.4 A, 3.2 A, and 3.3 A resolutions, respectively 
(Table SI). With the exception of the region encompassing amino 
acids 454-464 of nspl4 and the C terminus of nspIO (residues 132- 
139), all residues of nspl4 and nspIO could be built in the final model 
(Fig. IB). The complex structure reveals that one molecule of nspIO 
binds one molecule of nspl4 (Fig. IB). Furthermore, nspl4 is 
bimodular; amino acids 1-287 fold into the ExoN domain, and amino 
acids 288-527 form the N7-MTase domain. A convoluted loop 
consists of amino acids 288-301, and a break in it could result in 
abolishment of the N7-MTase activity (28). Electron density maps of 
the two active centers are shown in Fig. S2. NspIO interacts exclu¬ 
sively with the ExoN domain of nspl4 (Fig. IB), as is consistent with 
previous biochemical results showing that nspIO stimulates the ExoN 
activity without perturbing the N7-MTase activity (14, 25). 

Structure of the nsp14 ExoN Domain. The ExoN domain contains a 
central, twisted p-sheet made up of five p-strands (Fig. 24). The 
strands form a parallel p-sheet with the exception of p3 and are 
flanked by a-helices on either side. Such an arrangement of the core 
structural elements is reminiscent of the structures of the DEDD 
superfamily exonucleases, such as the s subunit of E. coli DNA 


polymerase III [Protein Data Bank (PDB) code 1J53; Z score of 
11.7, rmsd of 3.0 A over 174 aligned Ca atoms as calculated by 
DALI server (Fig. 2 B)\ (29). One Mg 2+ ion is observed at its active 
center (Fig. 24). However, three major structural differences set 
nspl4 apart from typical DEDD family exonucleases (Fig. 24). 
First, the N-terminal Alal-Arg76 forms a long, flexible region that 
interacts with nspIO. Second, a p-hairpin structure containing p5 
and p6 (Alall9-Aspl45) also is observed to interact with nspIO. 
Last, the most striking difference is the presence of a zinc finger on 
either side of the p-sheet. The first zinc finger, comprising of 
Cys207, Cys210, Cys226, and His229, is located between a4 and 
plO, and the second zinc finger comprising, His257, Cys261, His264, 
and Cys279, is located between a5 and a6 (Figs. IB and 24). 

Comparison of nsp14 ExoN Active Sites with Proofreading Homologs. 

Although the overall structure of the nspl4 ExoN domain has di¬ 
verged significantly from other proteins, the architecture of the 
catalytic core and active sites resembles those used by DEDD-type 
exonucleases, such as the proofreading ExoN domain of DNA 
polymerase I Klenow fragment (PDB ID code 1KLN) and the s 
subunit of DNA polymerase III (PDB ID code 1J53) of E. coli , 
suggesting a conserved mechanism for catalysis. These proteins and 
nspl4 share a similar two-metal-ion-assisted mechanism for re¬ 
moval of misincorporated nucleotides (30-32). In the nsplA-nsplO 
complex structure, only one Mg 2+ coordinated by Asp90 and 
Glul91 is observed. The absence of the second metal ion could be 
attributed to the lack of the substrate or product binding (30). In¬ 
gredients such as the 3' end of DNA, metal ions, and the solvent 
molecule required for catalysis are positioned in place by side chains 
of D355, E357, D424, Y497, and D501 of the Klenow fragment 
(Fig. 2C) (33). The ExoN domain of nspl4 exhibits a similar con¬ 
stellation of acidic amino acids, with one notable difference. A 
conserved aspartate of motif II (34, 35), for instance Asp424 of the 
Klenow fragment and Asp 103 of the s subunit of polymerase III, is 
replaced by Glul91 in nspl4 (Fig. 2C and Fig. S3) (31, 33). In 
addition, His268 categorizes nspl4 as a DEDDh-type exoribonu- 
clease. The position of these catalytic amino acids around a nu¬ 
cleotide modeled in the active site of ExoN is shown in Fig. 2C. The 
Mg A activates one molecule of water to initiate the nucleophilic 
attack on the phosphorous of the substrate, whereas Mg B facilitates 
the leaving of the product (Fig. 2C). Simultaneously mutating 
Asp90 and Glu92 to alanine impaired the ExoN activity signifi¬ 
cantly, whereas E191A, H268A, or D273A mutants were severely 
deficient in their ability to degrade RNA, confirming the impor¬ 
tance of these amino acids in the excision of nucleotides (Fig. 2D). 
Intriguingly, Asp243, previously mistaken as a catalytic residue (5, 
16), is the fifth highly conserved acidic amino acid located between 
motifs II and III (Fig. 2C and Fig. S3) (34). The ExoN activity of 
D243A mutant is completely lost (Fig. 2D). In contrast to reported 
results, the activity of D90A/E92A, D243A, H268A, and D273A 
complexed with nspIO or not on different substrates has been shown 
to be nearly identical (14, 16, 20). 

Further, we mutated residues of the two zinc fingers of the ExoN 
domain of nspl4 to decipher their functions. Zinc finger 2 is in 
proximity to the catalytic residues. Disruption of this zinc finger via a 
C261A or H264R mutation abolished the enzymatic activity, sug¬ 
gesting this zinc finger has a role in catalysis (Fig. 2D). In stark 
contrast to mutants of zinc finger 2, none of the mutants of zinc finger 
1 could be expressed as soluble proteins. Inspection of the structure 
reveals that zinc finger 1 contributes the structural stability of nspl4. 

Thus, Asp90, Glu92, Glul91, His268, and Asp273 of nspl4 are 
likely to provide the electrophilic environment necessary for li¬ 
gand binding and catalysis. In addition, both the zinc fingers of 
the ExoN domain are essential for the function of nspl4. 

Stimulation of ExoN Activity of nsp14 by nspIO. In the absence of 
nspIO, nspl4 cannot catalyze nucleotide excision efficiently (Fig. 2D). 
The structure of the nsplA-nsplO complex explains this requirement 
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Fig. 2. Comparison of the structure and catalytic residues of nsp14 ExoN domain with proofreading homologs. (A) Cartoon representation of the ExoN domain marked 
with secondary structural elements. The three different regions from other DEDD superfamily exonudeases are indicated by red dashed ellipses. ( B) The structure of the 
E. coli s subunit of polymerase III (Pol III) is shown in the same orientation as nsp14for comparison. Metal ions are shown as spheres, and bound ligands are shown as sticks. 
(C) The active center of the ExoN domain of nsp14. (Upper) Catalytic residues of the ExoN domain of nsp14, the exonuclease domain of DNA polymerase I, and the s 
subunit of DNA polymerase III of E. coli are listed in the table. (Lower) Catalytic residues, the modeled substrate AMP, and the mistaken D243 are shown as sticks. Mg B 
observed in the structure and Mg A modeled are shown as spheres. Dashed lines indicate the hydrogen bonds between Mg B and D90 and El91. (D) Exoribonuclease assays 
for nspIO, nsp14 alone, and nsp14 or nsp14 mutants in complex with nspIO on 5'-labeled ssRNA of 22 nucleosides (RNA22). The symbol "#" indicates cleavage products. 


of nspIO for enzymatic activity of nspl4. Known structures of 
nspIO could be superimposed over the nspl4-nspl0 complex 
with an rmsd of <0.8 A between matching Cot atoms over the 
entire length of nspIO. Two regions of nspIO contribute the 
major residues for nspl4-nspl0 interaction (Fig. 3 A). The first 
contact area involves the entire N-terminal loop and helix al 
(Prol-Leu24) of nspIO, which has led to interpretable electron 
density for these first nine residues of nspIO that were not ob¬ 
served in previous structures (26, 27, 36, 37). The residues Alai, 
Asn3, and Glu6 of nspIO stabilize the N terminus of nspl4 
by forming hydrogen bonds with Lys9, AsplO, and Thr5, re¬ 
spectively, whereas Phel6, Phel9, and Val21 of nspIO form van 
der Waals interactions with Phe60, Met62, and Tyr64 of nspl4 
(Fig. 3 B and D). The second region of intermolecular interactions is 
extensive and encompasses residues from the loop region following 


helix a2 and residues around zinc finger 1. Here, a number of com¬ 
plementary hydrogen bonds are observed; Asn40, Lys43, Leu45, 
Thr58, Ser72, Lys93, and Tyr96 of nspIO interact with Thr25, His26, 
Cys39, Asp41, Ala23, Tyr51, and Hisl9 of the N terminus of nspl4. A 
salt bridge formed between His80 of nspIO and Aspl26 of nspl4 and 
a hydrogen bond between Cys90 of nspIO and Asnl29 of nspl4 
stabilize the structural elements between (35 and (36 of nspl4 (Fig. 3 C 
and D). The extensive interaction of nspIO with nspl4 suggests that 
nspIO might be necessary to maintain the structural stability of the 
ExoN domain and fully unleash the ExoN activity of nspl4 (14). 

Previous mutagenesis studies had shown that regions of nspIO 
contacting nspl4 and nspl6 overlap substantially (38, 39). 
Comparison of the nspl4-nspl0 complex with the nspl6-nspl0 
complex surprisingly reveals that a significantly larger surface area 
of nspIO contacts with nspl4 (Fig. S4). The buried solvent- 
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Fig. 3. Intermolecular interactions between nsp14 and nspIO. Zinc ions are represented as gray spheres in A-C. (A) The Nsp14 ExoN domain (green) is 
stabilized by nspIO (red). Two regions of nspIO (boxed) contribute major interactions with nsp14. Zn1 nsp1 °, the first zinc ion of nspIO. (B and C) Interaction 
details by regions 1 (B) and 2 (C). Hydrogen bonds between residues are shown by dashed lines. Residues of nsp14 and nspIO involved in interaction are 
displayed as green and red sticks, respectively. (D) Schematic representation of the contacts between nspIO and nsp14. 
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Fig. 4. Structure and methyl transfer mechanism of the nsp14 N7-MTase 
domain. (A) Cartoon representation of the N7-MTase domain marked with 
secondary structural elements. ( B ) Amino acids within 4 A of the ligands SAH 
(blue) and GpppA (red) are labeled and numbered. The magenta arrow in¬ 
dicates the methyl transfer. Dashed lines between residues indicate hydrogen 
bonds. Trp385, Asn386, Phe401, and Phe506 are shown by dashed bonds to 
depict their position below the plane of ligand GpppA. (C) The ability of nsp14 
to methylate N7 of guanine of GpppA-RNA was measured. The results depict 
the efficiency of the conversion of substrate to product (%) and are plotted as 
a bar graph. WT nsp14 and its mutants were complexed with nsplO. 


accessible areas of nsplO for contacting nspl4 and nspl6 are 2,236 
A 2 and 938 A 2 , respectively. Because nsplO is encoded in about 
three- to sixfold excess over nspl4 and nspl6, the nspl4-nspl0 
complex and nspl6-nspl0 complex can exist simultaneously. 

Structure of the nsp14 N7-MTase Domain. The N7-MTase domain of 
nspl4 contains an atypical MTase fold (Fig. 44). The central (3-sheet 
is made up of five (3-strands instead of seven; the canonical (33 and (36 
stands are missing. Among these strands, (32', pi', p3', and p4' are 
positioned parallel, whereas p8' runs antiparallel. Contrarily, a 
seldom-observed insertion of a three-stranded antiparallel p-sheet 
between canonical strands p5 and p6 of the central sheet is observed 
in nspl4 (Fig. 44). The presence of such a sheet has been noted in 
the structure of the N7-MTase from Vaccinia virus (PDB ID code 
2VDW). This small p2-sheet is positioned almost perpendicular to 
the central pi-sheet. A cavity between sheets pi and (32 functions as 
a ligand-binding pocket. Two small helices are embedded in the 
connecting loops of the pl-sheet, and a lone a-helix, al', is stacked 
against the opposite face of the central pl-sheet. Behind this a-helix 
lies another long a-helix, a2'. The third zinc finger of nspl4 formed 
by Cys452, Cys477, Cys484, and His487 is located at the tip of this 
helix and protrudes from the protein at its C terminal (Figs. IB and 
44). Intriguingly, an additional a-helix, a3' spanning the last 12 


amino acids (Thr516-Gly527) is observed stabilizing the local hy¬ 
drophobic environment (Fig. S5). Such a modification at the C ter¬ 
minus has been observed previously in the nucleic acid-binding SAM- 
dependent MTases (40). Truncation of this region has been shown to 
attenuate greatly or abolish the N7-MTase activity of nspl4 (20). 

Structural Basis for Methylation of N7 of Guanine by nsp14. In 

comparing the structures of unliganded, SAM-bound, and SAH- 
GpppA-bound nspl4-nspl0 complexes, no significant structure 
movement is observed, suggesting that the ligand-binding sites 
are preformed. The ligands bind in a pocket surrounded by 
sheets pi and p2 and helix al' (Fig. 44 and Fig. S6). 

The methyl donor SAM sits above the central pi sheet in the 
space between the loops connecting strands p2' with p3' and p3' 
with p4'. The purine ring is bound in a hydrophobic environment 
created by amino acids such as Ile332, Phe367, and Val389 (Fig. 
4 B). The 061 and 062 of Asp352 form hydrogen bonds with the 
03' and 02' atoms of the ribose, respectively (Fig. 4 B). Substitution 
of Asp352 with alanine reduces the N7-MTase activity of nspl4 by 
20% (Fig. 4C). Asp331 and Gly333, at the end of pi', which pre¬ 
viously were shown to be essential for the N7-MTase activity of 
nspl4 (20), are observed in proximity to the carboxyl group of 
SAM. The carbonyl oxygen of Gly333 forms a hydrogen bond with 
the amino nitrogen of methionine of SAM. Not surprisingly, a 
D331A/G333A double mutation completely abolished the N7- 
MTase activity (Fig. 4C). The Trp292 at the N-terminal loop of N7- 
MTase (Lys288-Asp301) stacks against the ribose for optimal po¬ 
sitioning of SAM for catalysis. Disruption of the interaction of 
Trp292 with ribose reduced the N7-MTase activity by more than 
50% (Fig. 4 B and C). These results are consistent with those 
reported for the methyltransferase from the Encephalitozoon 
cuniculi (Ecml), in which mutations disrupting the interaction of 
the N7-MTase with ribose and methionine moieties of SAM im¬ 
pair activity greatly or completely, but those affecting interactions 
with adenine are not important for activity (41). 

Methyl receptor GpppA binds near SAM. Side chains of Phe401, 
Tyr420, Phe426, Thr428, and Phe506 entrench and hold the purine 
moiety of guanosine in position (Fig. 4 B). Among these amino acids, 
Phe426 showed the largest influence on the N7-MTase activity, and 
F426A mutation reduced MTase activity by 50% (Fig. 4C). Asn386 is 
located in immediate proximity to the atoms involved in methyl 
transfer and forms two hydrogen bonds with the guanine moiety to 
help orient it during catalysis. An N386A mutant abolished more 
than 50% of the N7-MTase activity (Fig. 4C). The 06 atom of 
Asn422 is observed to form hydrogen bonds with the 03' atom of the 
guanosine ribose moiety. Mutating it to alanine does not significantly 
impact the N7-MTase activity. Asn306, Arg310, and Lys336 con¬ 
tribute the positive potential for binding the triphosphate moiety. In 
particular, N62 of Asn306 as well as Nr|l and Nr|2 of Arg310 interact 
with the second phosphate group (Fig. 4 B). The importance of these 
side chains in catalysis is underscored by the fact that the R310A 
mutant retained only 20% of the activity, whereas the K336A mutant 
retained 50% (Fig. 4C). Last, Trp385 stacks against the adenosine 
moiety of GpppA, and a W385A mutation reduced the activity by 
more than one third (Fig. 4 B and C). 

The mutations R310A, P335A, K336A, D352A, and Y420A have 
been reported; the changes in the activity of these mutants are 
consistent with our results, except that Y420A was reported to be 
inactive (28), but we found that the L419A/Y420A double mutant 
retained 70% activity. We believe our results to be reasonable, 
based on the location of these two residues. 

Thus, the ligands are held in a highly constricted pocket, en¬ 
suring that the methyl donor SAM and acceptor GpppA are held 
in close proximity. By enforcing charge and shape complemen¬ 
tarity, the ligands are oriented in the pocket so that the methyl 
group of SAM is brought in the vicinity of the N7 of guanine to 
realize the methyl transfer using an in-line mechanism, as pre¬ 
viously proposed for Ecml (41). 
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Intriguingly, zinc finger 3 of nspl4 located in the N7-MTase 
domain is spatially isolated from the active site, ruling out a direct 
role for this motif in catalysis (Fig. 44). Consistent with this ob¬ 
servation, C452A and H487R mutations of zinc finger 3 had only a 
marginal effect on the MTase activity (Fig. 4C). The peripheral 
location of zinc finger 3 may be more suited to forging protein- 
protein interactions, for example binding with the nspl6-nspl0 
complex to accomplish the second methyl transfer reaction essential 
for formation of capl structure. 

Raw TLC data of the N7-MTase activity assays are included as 
Fig. S7. 

The Interactions Between the Two Naturally Fused Domains. The 

ExoN and MTase domains of nspl4 are interlinked, and they in¬ 
teract using hydrophobic interactions (Fig. S8). Three a-helices of 
ExoN stabilize the N-terminal loop (Lys288-Asp301) and the 
(32-sheet of N7-MTase domain, the base of the substrate binding 
pocket of N7-MTase. Ile80 and Val83 of al, Leul77 of a3, and 
Val282 and Phe286 of a6 form hydrophobic interactions with 
Val294, Tyr296, Pro297, Ile299, Leu411, Pro412, and Leu419. All 
these amino acids are highly conserved within members of the 
coronavirus genus (Fig. S3), highlighting an important conserved 
role for these residues (28). 

Model for the Role of nsp14 in Proofreading and mRNA Capping. To 

envision how nspl4 could participate in the functioning of the 
replication and transcription complex (RTC), we attempted to infer 
the interaction mode between nspl4 and nspl2, the RdRp. The 
polymerization and proofreading sites of DNA polymerases are 
located on different domains or subunits that are typically separated 
by a distance of 30 A or more (42). In contrast, during transcription, 
RNA polymerases carry out both these functions using the same 
active sites (43). In this context, nspl4 is functionally and structur¬ 
ally more similar to the proofreading exonuclease domain of DNA 
polymerases. Using the Imown structure of the Klenow fragment of 
DNA polymerase I (PDB ID code 1KLN) as a guide, the ExoN 
domain of nspl4 was placed next to the polymerization site (Fig. 5). 
During instances of mismatch of nucleotides, the nascent strand is 
moved to the ExoN site for excision. As the newly synthesized 
strand is extended, a capO structure is first assembled at the 5' end. 
This assembly would require the enzymatic activities of a RTPase 
contributed by nspl3 (24), a guanylyl transferase, which is yet to be 
identified, and the N7-MTase activity of nspl4. Last, the 2 -0- 
MTase activity of nspl6 would result in the formation of a capl 
structure at the 5' end of the newly synthesized RNA. Such a model 
accounts for nspl3 functioning as a helicase for unwinding double- 
stranded nucleic acids and the direct nspl3-nspl2 interactions (44, 
45). Further experiments are necessary to shed more light on the 
relative orientation of the various active sites of the RTC and how 
the RNA is maneuvered for formation of the capl structure. 

Conclusions 

The structures of SARS-CoV nspl4 in complex with nsplO de¬ 
scribed here reveal that the core structural elements and catalytic 
residues of the proofreading ExoN domain of nspl4 are highly 
similar to those of its eukaryotic and prokaryotic predecessors. 
Furthermore, nspl4 of SARS-CoV has gained some striking and 
remarkable structural features to accomplish additional tasks such 
as the methylation of N7 of guanine of cap precursors. Our studies 
unveil several of these nsp 14-specific features. First, nsplO contacts 
the nsp 14 ExoN domain to provide structural support and facilitate 
its ExoN activity for proofreading. Second, two zinc fingers are 
located on the ExoN domain. Our mutagenesis studies show that 
both zinc fingers are essential for the nspl4 function. Third, the N7- 
MTase domain of nspl4 adopts an atypical MTase fold and ropes in 
a second (3-sheet to bind the substrates in a highly constricted cavity 
to accomplish methyl transfer. This N7-MTase domain carries 
a third zinc finger that is located remote from the active sites. 


nsplO 



RNA synthesis and proofreading mRNA capping 


Fig. 5. Model for the role of nsp14 in proofreading and mRNA capping. Nascent 
RNA is synthesized at the polymerization sites (red circle) of the nsp12 RdRp do¬ 
main (gray) or is mismatch excised at the proofreading site (red circles) of nsp14 
ExoN (green surface). The relative orientation of the polymerization and proof¬ 
reading domains is built based on the Klenow fragment. The 5' end of newly 
synthesized mRNA is modified by sequential activities contributed by RTPase of 
nsp13, GTase (currently unknown), N7-MTase of nsp14, and 2-O-MTase of nsp16 
(light pink surface, PDB ID code 3R24) for formation of a capl structure. 

Combined with previous functional data, this work provides evidence 
for the likely mechanism underlying the involvement of nspl4 in viral 
replication and transcription. 

Materials and Methods 

Protein Production. Full-length nsplO and nsp14 of SARS-CoV isolate Tor2/FP1- 
10851 were synthesized by GENEWIZ Inc. and cloned into pGEX-6p-1 and 
pRSFDuet-1, respectively. NsplO with an N-terminal GST tag and nsp14 with a 
C-terminal His tag were coexpressed in E. coli BL21 (DE3). Cells were grown at 
37 °C and induced with 0.1 mM isopropyl p-D-thiogalactoside when OD 60 o n m 
reached 0.8. After induction at 16 °C for 18 h, cells were harvested, resuspended, 
and lysed by sonication in 20 mM Hepes (pH 7.0), 150 mM NaCI, 4 mM MgCI 2 , and 
5% (vol/vol) glycerol. After centrifugation, recombinant protein was purified by 
GST-affinity chromatography. The GST tag of nsplO was removed by PreScission 
protease. Excess nsplO was separated from the nsp14msp10 complex by HiTrap 
S ion-exchange chromatography (GE Healthcare). 

Mutagenesis was performed using the Easy Mutagenesis System (Transgen 
Biotech). All nsp14 mutants were coexpressed with nsplO and purified as 
described above. 

Nsp14 alone was purified using Ni-nitrilotriacetate affinity resin (GE 
Healthcare) with a gradient of imidazole for washing and elution. 

Crystallization. Crystallization was performed by the hanging-drop vapor- 
diffusion method at 16 °C. Each crystallization drop consisted of 1 |.iL of 
nsp14-nsp10 complex (10 mg/mL) mixed with 10 mM DTT and 0.25 mM SAM 
and 1 \il of the mother liquor equilibrated over 200 |jL of reservoir solution. 
Diffraction-quality crystals grew in 15% (vol/vol) Tacsimate (pH 7.0), 0.1 M 
Hepes (pH 7.0), and 2% (wt/vol) polyethylene glycol 3,350 after optimization 
by microseeding. Diffraction was further improved by dehydration in 25% 
glycerol for 3 min. To obtain a complex of nsp14-nsp10 with SAM and 
GpppA, crystals were soaked with 50 mM GpppA (New England Biolabs). 

Data Collection and Structure Determination. X-ray diffraction data were col¬ 
lected at Shanghai Synchrotron Radiation Facility beamlines BL19U and 17B. All 
data were processed with the program HKL3000 (46). The initial phases were 
calculated by the single-wavelength anomalous dispersion method using the 
anomalous signal of zinc atoms with the program PHENlX.autosol (47). A total 
of 10 zinc atoms were found using HYSS (48). After phasing, twofold non¬ 
crystal lographic symmetry was identified and used in subsequent density mod¬ 
ification, dramatically improving the phases of diffraction data. Two molecules 
of nsplO (PDB ID code 2G9T) were docked into the map using MOLREP (49). 
Other parts of the complex were manually built in COOT (50) and iteratively 
refined in PHENlX.refine (51). To solve the structure of the unliganded nsp14- 
nsplO complex or nsp14-nsp10-SAH-GpppA complex, molecular replacement 
was done first in MOLREP using the structure of the nsp14-nsp10-SAM complex 
as the search model, and then the structures were refined in PHENlX.refine. Data 
collection and refinement statistics are listed in Table SI. 
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Exoribonudease Activity Assay. ssRNA made up of 22 nucleotides (RNA22, 
5 , -GGGCGAUUAGGAGCUAACUGCG-3 , ) was used as a substrate for activity as¬ 
says (16). To obtain 5'-labeled RNA22, it was incubated with T4 polynucleotide 
kinase (New England Biolabs) and y-[ 32 P]-ATP (PerkinElmer). MicroSpin G-25 
columns (GE Healthcare) were used to remove excess y-[ 32 P]-ATP. Later, RNA22 
was extracted with phenol-chloroform and precipitated with ethanol. 

Reaction mixtures contained 300 nM nsp14msp10 complex (or nsp14 mutant- 
nspIO complex or nsp14 alone), 1,000-cpm labeled RNA22, and 300 nM unlabeled 
RNA22 in a buffer made up of 50 mM Hepes (pH 7.0), 50 mM NaCI, 5 mM MgCI 2 , 
and 1 mM DTT. After incubation at 37 °C for 5 or 30 min, the reactions were 
stopped by the addition of an equal volume of loading buffer (96% formamide 
with 10 mM EDTA). Products were separated on 20% 7-M urea-containing 
polyacrylamide gels and visualized through Phosphorlmager. 

MTase Activity Assay. DNA fragments including the optimized T7 class II pro¬ 
moter 02.5 with ATP as initial nucleotide (52) and the 5'-terminal 259 nucleo¬ 
tides of the SARS-CoV genome (20) were used as a template for in vitro 
transcription. We then used the Vaccinia capping system (except for SAM) (New 
England Biolabs) and a-[ 32 P]-GTP to label the 5' terminus of the RNA to G*pppA- 
RNA. To improve the efficiency of the reaction, 0.05 U inorganic pyrophosphatase 
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(New England Biolabs) was added. G-50 Sephadex columns (Roche) were used to 
remove unused a-[ 32 P]-GTP. RNA was extracted with phenol-chloroform and 
precipitated with ethanol. 

MTase activity was tested as follows: 0.1 pg nsp14-nsp10 complex or nsp14 
mutant-nspIO complex was mixed with 1,000-cpm labeled RNA in a buffer 
made up of 50 mM Hepes (pH 7.0), 6 mM KCI, 5 mM DTT, 1 mM MgCI 2 , and 
0.2 mM SAM. After incubation at 37 °C for 6 min, 5 pg nuclease PI (Sigma) and 
1 mM ZnCI 2 were added to digest the RNA. Reaction products were spotted on 
polyethylenimine cellulose plates (Merck) to separate G*pppA from capped 
m7G*pppA and were visualized using a Phosphorlmager. 

The marker m7G*pppA was prepared as above, except that inorganic 
pyrophosphatase was replaced by 0.2 mM SAM. 
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